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The central problem of concern to Serrano, Bogufia and 
Vespignani |lj can be effectively and elegantly addressed using 
a well-established two-stage algorithm that has been applied 
to internal migration flows for numerous nations and several 
other forms of "transaction flow data" 

In the first stage, the A'^ row and A'^ column sums of 
the N X N matrix of weighted, directed network flows (fij) 
are alternately scaled to all equal 1, iteratively, until suf- 
flcient convergence to a doubly-stochastic table is attained. 
So doing serves as a control for multiscale effects. (This 
procedure-convergent under broad conditions [5]-leaves the 
cross-product ratios ^'^J" [measures of association] invariant, 

and provides maximum-entropy estimates given the doubly- 
stochastic constraint. One may also consider for highly sparse 
networks, a preliminary smoothing of the matrix entries, or 
an adjustment of row and column sums to be proportional, 
not to 1, but to the number of non-zero entries in the row or 
column. Powering a doubly-stochastic matrix yields another 
[smoother] doubly-stochastic matrix.) 

In the second stage of the algorithm, the nodes (ver- 
tices) of the network are hierarchically clustered. One starts 
with an A''-node graph with no links. Then, if the y'-entry of 
the doubly-standardized table is the greatest, a link is drawn 
from node i to node j, and so on, using the second, third... 
greatest links. Larger and larger strong components of the di- 
rected graph (digraph) emerge, until all the nodes are united 


of America 

in a single strong component (that is, there exists a path of 
directed links from any node to any other). This flnal digraph 
(with links labeled by the original flow values) is an obvi- 
ous candidate for the "multiscale backbone" of the complex 
weighted network. (In our recent study of migration between 
the 3,107 U. S. counties, this backbone consisted of 25,329 
links d]. In the methodology of Serrano, Bogufia and Vespig- 
nani [1], some of the nodes may be omitted from the back- 
bone, depending upon the significance level a employed, while 
the single strong component includes all A'^ nodes. By stop- 
ping the hierachical clustering before completion, however, 
in some suitable statistical manner, one might also isolate 
nodes.) An 0{M log N) algorithm, where M is the number of 
edges of the network-developed by R. E. Tarjan-exists for the 
hierarchical clustering. If the digraph of the original network 
is not strongly connected, the sub-networks corresponding to 
its individual strong components can each be independently 
analyzed using the two-stage algorithm [5]. 

The two-stage procedure has proved effective-to use the 
internal migration context-in recognizing regional structures 
(such as the islands of Japan [2] and French Louisiana |4]) 
and in distinguishing "cosmopolitan" geographic units (such 
as Paris [3] and U. S. "Sunbelt" counties l4])-those with broad 
(hub-like or centralized) ties-from "provincial" units 
Graph-theoretic tests of an ordinal nature can be applied to 
assess the significance of the clusters found [H sec. IV. B]. 
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